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Abstract— In this article, we consider a control loop where a 
scheduler collocated with the sensors decides when to transmit 
the measured output to the controller with the objective of 
saving valuable communication resources, while still meeting 
desirable closed-loop stability and performance requirements. 
The sensors provide, in general, only measured outputs of the 
system. The performance of the system is defined as the /2-gain 
between an exogenous disturbance input and a performance 
output of interest. When the scheduler triggers transmissions 
periodically, one can determine the controller that results in a 
given ¢2-gain bound along with conditions for its existence by 
following a game-theoretical approach. Based on these results, 
we tackle the problem of designing a scheduler-controller pair 
that guarantees the same /2-gain bound as the designed periodic 
time-triggered /2-controller, however, with a lower or at most 
the same average transmission rate from the sensors to the 
controller. We refer to this property as /2-consistency and thus 
we will provide such an /2-consistent controller-scheduler pair. 
Numerical examples show significant communication savings 
for typical disturbance inputs, while guaranteeing the same £2- 
gain bound for the control loop as the periodic solution. 


I INTRODUCTION 


Ensuring safe and reliable operation of control systems 
in the presence of unwanted disturbances is crucial in many 
control applications in areas such as automotive, aerospace, 
healthcare, power generation and many others. In many of 
these applications no probabilistic information about the 
disturbances is available and the system has to be designed 
to cope with all possible disturbances, including worst-case 
scenarios [1]. This usually leads to conservative designs 
and often a particularly large waste of computation and 
communication resources. A prime example is the choice 
of conservatively large sampling and transmission rates to 
ensure that signals flow rapidly among sensors, controllers 
and actuators so that the performance requirements are met 
for all possible disturbances; thereby the whole control 
infrastructure is configured based on worst-case scenarios, 
while it is intended for typical (common) scenarios. This 
calls for more resource-aware and more flexible computation 
and communication strategies that adapt based on the actual 
disturbance scenarios encountered. In this article we will 
focus on such designs in the context of @2-control. 

The @)-control (or Ha) framework has been widely used 
since the 90s in order to ensure that performance specifica- 
tions are met for all possible disturbances within a certain 
class. In fact, within this framework, considering a discrete- 
time linear system, with output measurement, additive dis- 
turbances and a performance output of interest, one can 
assert if, and design a controller such that, the ?2-attenuation 
level from the disturbances to the performance output is 





Fig. 1: The output feedback ¢-controller when communi- 
cation takes place over a resource-constraint communication 
network. G, K, S and N refer to the plant, the controller, the 
scheduler and the network, respectively. The state estimation 
in the scheduler is denoted by 7. 


bounded by a prescribed value for all squared summable 
disturbances. The infimum value of this attenuation level 
is called the /2-gain of the system. The problem that we 
tackle in this article, is to understand if by using a different 
communication scheme (other than periodic) one can provide 
the same guarantees, but with a smaller average transmission 
rate. In particular, we do this for the setup depicted in Fig. 1, 
where G, S, K and N represent the discrete-time plant, 
the transmission scheduler, the controller and the commu- 
nication network between the scheduler and the controller, 
respectively. The goal of the joint scheduling and 2-control 
policy is to attenuate the effect of the disturbance inputs w 
on the performance output z of the system. This problem 
can be positioned in the area of event-triggered control 
(ETC), see, e.g. the overviews [2], [3], which tackles the 
problem of designing appropriate (state or output-dependent) 
schedulers to arbitrate data transmission from the sensors to 
the controller. 

We investigated this problem in the framework of full-state 
feedback in [4], [5] and here we focus on the more chal- 
lenging framework of output feedback. We start by noticing 
that, when the scheduler triggers transmissions periodically, 
one can determine the controller that results in a given 
fy-gain bound along with conditions for its existence by 
following a game-theoretical approach. Using these results, 
we explore the design of an ETC, borrowing the notion 
of “f2-consistent ETC” from our previous works [4], [5]. 
This consistency concept refers to any ETC that preserves 
the same 9-gain bound as a periodic time-triggered £2- 
controller, however, with a lower or at most equal average 
transmission rate. The proposed event-triggered scheduling 
law in [5] depends explicitly on the disturbance input, which 
can only be calculated if the state vector is fully measured 
at all time steps. However, when the sensors only measure 


an output, rather than the full-state, then the calculation of 
the disturbance inputs is, in general, not possible. The main 
contribution of this work is to propose an f2-consistent ETC 
for this challenging output feedback case. 

Related work: In the ETC literature one can find a consid- 
erable amount of research that considers an average quadratic 
cost, which can jointly capture the effects of the average 
tracking errors and the control efforts as a performance 
index. Finding the optimal joint event-triggered scheduler 
and controller that minimize such a performance index, 
related to Hə control, is one of the problems of interest 
in the area [6], [7]. However, since this problem is in 
general hard, the research in this domain is inclined mainly 
toward the design of ETCs, which outperform periodic 
time-triggered controls (PTCs) in the sense of an average 
quadratic cost [8]-[11]. An ETC which can guarantee a better 
trade-off between the average quadratic performance and 
the average transmission rate in comparison to the optimal 
PTC is called consistent [9]. Researchers also investigate 
the design of ETCs in the context of £.-gain and £p- 
gain in recent years, see, e.g., [12]-[14]. As an example, 
a class of dynamic ETCs is introduced in [13] that provides 
a trade-off between inter-event times, maximum allowable 
delays and £,-gains. Furthermore, the design of ETCs from 
an l2 (or £2)-gain perspective has received a great deal of 
attention throughout the past decade, see, e.g., [15]-[19]. In 
most of these studies a Lyapunov-based approach is followed 
to design ETCs, which can provide a guaranteed bound 
for the 42 (or £2)-gain of the system. However, a clear 
comparison between the average transmission rate of the 
proposed ETC and its periodic time-triggered ¢2-controller 
counterpart, which guarantees the same 2-gain bound for the 
closed-loop system, is often absent. Therefore, we extend the 
notion of consistency for the event-triggered ¢2-controllers, 
which simultaneously takes into account an f)-gain bound 
of the control loop and the average data transmission rate 
from the sensors to the controller. 

We are just aware of one other research paper [20], where 
the proposed ETC has the same consistency property as we 
define in the present work. Both [20] and [11] build upon 
the results in [21] which proposes a constructive algorithm to 
redesign optimal Hə and Hə controllers, such that sampled- 
data solutions are still optimal under any sampling pattern. 
This approach is radically different in the Hə setting [11] 
from the approach in the mentioned works that rely on 
dynamic programming [6], [9], [10] . In particular it does 
not take into account the information that is present when 
a transmission does not occur [6]. Our approach is in the 
spirit of [22] where the min-max algorithm replaces dy- 
namic programming to account for worst-case disturbances. 
Since [20] is presented in continuous-time and follows a 
radically different approach, connections between the results 
are hard to establish. Both works are therefore of independent 
interest. 

Organization: The remainder of this paper is organized 
as follows. Firstly, a periodic time-triggered ¢-controller is 
designed in Section II, where the problem of interest is also 


introduced. An £2-consistent ETC is proposed in Section MI, 
which is the main contribution of this work. The effectiveness 
of the proposed ETC in decreasing the communication rate 
is demonstrated through a numerical example in Section IV. 
Section V presents some concluding remarks. The proofs of 
lemmas and theorems can be found in the appendix. 
Notation. For 7,s5€No:=NU{0}, we define 
Ns = {t € Noir < t < s} and &,., as the Hilbert 
space of sequences W,:5 := {wiens where w, €R? 
for all t € No. The @9-norm of Wrs € liig is given by 


lwnsle = VE ul, where (wi? = ww: We 
also define |w: := wi Qw: for every square matrix 


Q with suitable dimension. Moreover, ¢¢ is the Hilbert 


space of square summable sequences w = {w}tcNo 
with norm = |lwl|z, = 4/X ren lwll? A function 


f: R” > RU {—00} is said to be upper semicontinuous if 
f (xo) > limsup,_,,, f(a) at all xo € R”. 


II PROBLEM FORMULATION 


Section II-A introduces the problem setting with periodic 
communication and Sections II-C, II-D provide controllers 
that guarantee an @2-gain bound with periodic communica- 
tion, relying on the notion of information state, introduced in 
Section II-B. In Section II-E, we introduce the structure of 
the NCS, when the transmissions follow an event-triggered 
policy instead of a periodic one. Finally, we present the 
problem of interest in Section H-F. Throughout the article, 
the most important equations of each Section, Lemma or 
Definition are highlighted. This can help readers to quickly 
grasps the main messages. 


II-A Networked control system with periodic communica- 
tion 
Consider the system architecture in Fig. | in which the 
plant G is given by a discrete-time linear time-invariant (LTD 
system 
tt41 = Ate + Biwi + Bout, 
a = Cir, + Diour, (1) 
Yt = Cox, + Dove, 





where x, € R”, y € R°, % € RP, us € R™ and w, € RI 
are the state, the measured output, the performance out- 
put, the control input and the disturbance, respectively, at 
discrete time t € No. A, Bı, Bo, Ci, Di2, Co, and Do; 
are real matrices of appropriate dimensions and w € £4. 
Moreover, we define 6, € {0,1} to indicate whether com- 
munication takes place from sensors to controller at time 
t € No (6; = 1) or not (6; = 0). If 6; = 1, all the measured 
outputs (YL; 1415 YLı—1+2; -- - , Yt) after the last transmission 
time (before t) L;_; := max{@ € Nf '|őp = 1} up to (and 
including) the current time step t are transmitted. When the 
scheduler follows a periodic transmission policy with a given 
time period T € N, we set 6, = n7 at all t € No, where 


- 1, iftis zero or an integer multiple of 7 
T= ; (2) 
0, otherwise. 


Then, any periodic control policy can be formulated as 
us = RE (Fe); (3) 
for a mapping Rg” : F;"7 —> R™, where 
Fr” = {yili €No'} (4) 


is the information set available for the controller at every 
t € No. We need the definition of global asymptotic stability 
in the sequel. 

Definition 1: (Global asymptotic stability) The system (1) 
with w = (0,0,...) and a given controller is said to be 
globally asymptotically stable (at equilibrium point x, = 0), 
if: 

i) it is Lyapunov stable, i.e., for every Ç > 0, there exists 
an € > 0 such that for all initial states x) € R” with 
||xo|| < €, it holds for the corresponding state trajectory 
{xien that ||x:l| < ¢ for every t € No, 

ii) itis globally attractive, i.e., for all initial state xo € R”, 
the corresponding state trajectory {£+ }reno converges 
to x. = 0 as time goes to infinity, i.e., limy... £t = 0. 














Now we formally define the concept of a 7-periodic £2- 
controller for system (1). 

Definition 2: (t-periodic £2-controller) 
Given y € Ryo and 7 EN, a periodic control policy 
Rit: Ff" —> R”, t € No, for the system (1), where Fp” 
is given in (4), is referred to as a 7-periodic ¢-controller 
with l2-gain bound y [23], if: 


i) the control loop given by (1) and (3) is globally 
asymptotically stable, and 





ii) there exists a function ¢:R" >R, 

satisfying (0) = 0, such that for 

all we tg and any zo E R”, 
J:= |lzllé, — Plwl?, <) © 











The infimum value of y € R,o for which a r-periodic £2- 
controller exists with £2-gain bound y is called the infimal £2- 
gain of (1), and is denoted by y=. Moreover, the problem of 
finding a 7-periodic @2-controller with a given ¢2-gain bound 
is called the £j-control problem. 














II-B Information state 


In this section we recall the definition of the information 
state [24] for system (1). Then we determine the dynamics of 
the information state in Lemma | for a given initial condition. 
Finally, we mention how to solve the 7-periodic 2-control 
problem based on the information state. 

Definition 3: (Information state, [23, Eq. 4.7]) Let an 
upper semicontinuous function po : R” + RU {—oo} be 
given. Then for given uo-4—1 and yo-4-1, the information 
state pz(xz) € RU {—co} at every t € N for the system (1) 


is defined as 


ti 
pete) := max max COPE 


wo:t—1EL3 94-1 TOE i=0 

Ti+1 = Ax; + Bıwi + Bou, 
zi = Cixi + Diozu, 
Yi = Coa, + Dawi, 


where for all 7 € Nes : 





(6) 














The information state in Definition 3 is known as the 
largest value of a cost to arrive to a specific state x, at 
a specific time step t € N or as a sufficient statistic needed 
for the f2-controller design. In fact, it represents an upper 
bound of the cost function (p9 (ao) +Y% illl? — |] will?) 
at every time step, given that all the control inputs wo-;—1 and 
all the measured outputs yo.4—-1 are known up to the previous 
time step. Furthermore, it can be perceived as a bounded real 
lemma in state estimation, where Vi(x+) := —p:(x+) for all 
t € No, then based on (6) we can easily show that 


Vo(zo) > Valae) + Xizollzill? — lwl. 


In the next lemma, we determine the dynamics of the 
information state by solving (6) for a quadratic po. This 
becomes the basis of the t-periodic l2-controller design in 
the next section. To simplify the notation, let us first define 


Q = CTC, R:= DI,Dria, P= CT D12, L:= Bı D}, 
N := Da D}, Q:=Q-PR PI, A:=A-—LN'Cy, 
Ã:= A- BRP, A=A-—LN7!C)— BR IPI. 


We need the following assumptions in the sequel. 
Assumption l: It holds that 
i) (A, B2) is stabilizable, (C2, A) is detectable, 
ii) Do, is full row rank, (A, B,—LN~'Dzg1) is controllable, 
iii) D2 is full column rank, and (Cı — Dy2R7'PT, A) is 
observable. 
Assumption 1(i), D2, being full row rank and D12 being 
full column rank are standard assumptions in the literature 
(see, e.g., [22]). The second part part of Assumption | (ii) 
is further discussed in Remark 1 and Assumption I (iii) is 
an analogous condition to a condition provided in [22, Lem 
3.5] to establish existence of solutions to infinite-horizon 
Riccati equations. These are technical assumptions, which 
we introduce as sufficient conditions to prove our results. 
These conditions might introduce some convervativeness to 
the class of system our results apply to. 
Lemma 1: (State-information for system (1)) Suppose that 
Assumption 1 holds and for a given y € Ryo and a pos- 














itive definite matrix Yo, let po(ao) := —7"||olly—1 for 
all zo € R”. 

i) The information state in Definition 3 is 

bounded at every teN and is given by 








pelz) = —7° l£: — zll- th, (7) 
under the condition that 5; be positive definite for all 
t € No, where 


Si := Y; + CINTO —477Q, (8) 





in which 
Y1 = ÂS 7 'ÂT + B,BT-LN-1L™ (9) 
for all t € No. Moreover, the vector z; and the 


scalar Z; are given for all t € No by 


Tiq1 = Apt, + Bet, + Dit, (10) 


Za = Ze + lz, + all, -lalh aD 


with Zo = 0 and Zo = 0, where 

A; = Äl; ', Tl, =I —-?¥,Q, 
Bı = Ba +P ÂS P, S=S: +7 7(Q-Q), 
D, = AS CIN! + LV, Qi = QI; +, 

Ri = R+ 7~-?P1S;'P, 

V, = N-}(I — C287 'CIN-!), 


i (12) 
assuming II, has no zero eigenvalue, with 
Ue = u — (te +t), Ye=Ye—Ye, (13) 

for 

w=—-Kim, t= Kit, fe = Lik, (14) 
and 
K,=R P'S, CIN“, K,.=R P'S, Y}, 
Li = Coll. 


ii 


wa 


If Sp determined based on (8) and (9) has a negative 
eigenvalue at a given k € No, then p,(a,) determined 
based on (6) is unbounded, i.e. +00, for all t € N>k+1.- 














Remark I: Based on (7), the calculation of the informa- 
tion state at every time-step requires Y; to be invertible. It 
is worth mentioning that at every t € N, S;_; > O and the 
controllability of (A, B,—LN~1'Dz,), by Assumption 1(ii), 
are sufficient conditions to guarantee that Y, determined 
based on (9) is positive definite (see the proof of Lemma 1). 














Remark 2: We do not cover the case in which S% has one 
or multiple zero eigenvalues at a given k € No, see Remark 3 
below, for a more detailed discussions. 

Note that based on (10) and (11), the values of gą 
and Z, can always be determined if the information set 
{yi uili € No} is available at every t € N. In order to 
determine a 7-periodic £2-controller, we first consider the fol- 
lowing minimax optimization problem over a finite-horizon 
T: 


J* = MaX{y,=T:(Es)} 














T-1 
tENo 


. . (15) 
MUM u =R (Fe bent 2 J(Yo:T-1, U0:T-1); 
0 
where 
I lyo:r-1, uo:r-1) := max [pr(er) + q(xr)], 
xr ER” 
for some mapping Ti : E+ > R°, t € No, 
E := {yili E N`}, (16) 


at every t € No, and 


q(xr) = MINDY), -=M (eK )kensp 


MAXw,:=N (ep )eeny 7 (Re ellzell? E ¥||well?) 


is the value function of the full-state feedback zero-sum 
dynamic game in which M and M are appropriate mappings. 
Moreover, pr (xr) follows (7) determined in Lemma 1. Note 
that J(yo.r-1; uo:r—1) is independent of the disturbance 
inputs since both pr(xr) and gr(x7r) are independent from 
the disturbance inputs. The control policy determined by the 
solution of (15) can be shown to solve a output feedback £2- 
control problem over the finite-horizon [25]. After we de- 
termine such a controller by solving (15), we will find the 
conditions under which the determined policy can be the 
solution of the (infinite-horizon) 7-periodic ¢2-controller as 
introduced in Definition 2. This is carried out in the following 
subsections. 


(17) 


I-C 1-Periodic ¢-controller 


The next lemma gives sufficient conditions for the exis- 
tence of a 1-periodic ¢2-controller for a given y € Ryo, and 
provides a 1-periodic £2-controller with €2-gain bound +. The 
conditions are given in terms of positive-definiteness of given 
matrices. The lemma also states a converse result, showing 
that these conditions are strict. Namely if one of these 
matrices has a negative eigenvalue then no 1-periodic £2- 
controller for the given y € R,o exists. Recall that yj is 
called the infimal @j-gain of (1) based on Definition 2. The 
proof can be found in the appendix and relies on the solution 
to (15). 














Lemma 2: (1-Periodic ¢3-controller) Let As- 
sumption 1 hold and consider a given y€ Ryo. 
Suppose the following conditions hold 

Ay :=7I-YX>0, As:=S>0, (18) 
A; := 7°I — B] XB, > 0, 
where X and Y are the positive definite solutions of 
X=ATXH'A+Q, (19) 
Y = AS-! AT + B, BI — LN“L7, (20) 
for 
H=I1+(BoR'BI —y 7B, BI)X, on 


S=Y 1 +0IN tC- yQ. 
Then, y > yï and 
i) the control policy 








už = -Rİ P| z! — K(Ăz! + LN ty), (22) 
is a I-periodic @2-controller with /2-gain bound y 
based on Definition 2, where 


K :=R71(BI + y7? PT STH ĂT)G, 
2 





(23) 
for 
=Y + CINC -77Q, 


G = (HX! +y? LNT!L — y7’ ÄST! ÄT), 
(24) 


and 2 
BY := SHY tt, + CIN y), (25) 

with the state estimator 

Tt+1 = Az; + Blur — Ut = tit) + Dy | Ut), (26) 
for all t € No, with Zo = 0, 

A= Alr!, B= Ba + y7? ÂST!P, 

W=I-y "YQ, D = ASON! + LV, 

V=N'(1-C,8-'CIN7}), 


and 

ùt = —Ky,, a, = —Ki,, hi = Le, (27) 
for 
K=R™'PtSCIN, K =R PITY, 
L=c,1Ir!. 


ii 


wm 


the performance index J in (5) is upper bounded b 





oO 

F< lolly +) lu — uilla — lye — ve lla, 
t=0 

(28) 





or 
Up := (C2 + qh XH AA ah, 


and having R := R +77? PTS71P, then 


ð := R+ BIXA;'B, YV :=7°V — DIGD. 

(30) 

Moreover, if for a given y € Ro, one of the matrices 
Ai, A2 or Ag in (18) has a negative eigenvalue, then 
yi: 
Remark 3: We do not cover the case where for a given y € 
Rs 9, one of the matrices A1, Az or Ag has one or multiple 
zero eigenvalues. Note that typically, the set of y € R for 
which this is the case, has zero measure. Therefore, from 
a practical point of view this is not a severe limitation of 
our work. It is worth mentioning this is the only reason we 
cannot call the conditions in (18) necessary and sufficient for 
the existence of 1-periodic 2-controller. 
Remark 4: Under Assumption 1, the solutions of the sta- 
tionary Riccati equations (19) and (20) for a given y € R,o 
should be positive definite so that the control policy (22) 
solves the 1-periodic £2-control problem. In fact, X is needed 
to be positive definite even for the state-feedback ¢2-control 
problem, see, e.g., [22]. Moreover, based on Remark 1, under 
the condition that Ay > 0, Y is positive definite. 


(29) 






































II-D 1t-Periodic ¢2-controller fora r € N 


This section focuses on a multi-rate periodic ¢2-control 
problem in which the controller has access to the measured 
outputs only every 7 time-steps, but then also gets all 
intermediate/previous values. Recall that y* is called the 
infimal @)-gain of (1) based on Definition 2. Note that the 
output policy that is determined based on (15) depends on 
all the measured outputs from the initial up to the previ- 
ous time-step, i.e., up to t — 1. We introduce another time 


variable x € No, where « = |Ż] and define the augmented 


control inputs and measured outputs at every k as U, = 
lul,- -ul },—1]7 and Y, = [y]; ---yl-}r—1]7. Lemma 3 
presents a T-periodic 2-controller for a general r € N based 
on the solution of (15). 
Lemma 3: (t-Periodic f3-controller for TEN) 
Let Assumption 1 hold and consider a 
iven y € Ryo. Suppose the following inequalities hold 





A,;>0, Ag>0, ^A3>0, and Yh ENS, 
A24 :=7°I— (AIL tY A! +BB} )@p > 0, 





(31) 
for I = I — y “YQ, where Aj 9.3 are given in (18), X, Y 
are the positive definite solutions of (19), (20), respectively, 
and 





ATOnsiV, A if h e N77! 
Op := h+1’pn + Q, i EN, (32) 
X, if h =7, 
where V,_,:=I—7~?B,B]Oni1, VheEN7'. Then, 
y> y; and 


i) the control policy 





m = 

—R-' pl zt — K(Art + DN 1%), if a7 = 1 

otherwise 

(33) 

is a T-periodic l2-controller with @2-gain bound y 

based on Definition 2, where mj, K, z and x 

are given in (2), (23), (25), and (26), respectively. 
Moreover, 


-R!(B] XH-1A+ P")au:, 


4 











Sa = H! Ae, VEEN 

-J X1G(Art, + LN tyi), if m7, =1, 

ee otherwise, 
(34) 

where z¥ and G follow (25) and (24), respectively. 

the performance index J in (5) is upper bounded b 


TtHt-1> 


ii 


wa 





I< zoll- +X Un —USS, — Ye -Ye R, 
K=0 











(35) 
where for every k € No, 
Un = abt geal YR i= W er aeal 
(36) 
in which už follows (33) for all t € No, 
ye = (Coty VL XH A)Al Ser, 67) 


and for all i € N77’, 


Ue = Unt T Colt (erhi T y ?’Y Pusri), 
(38) 
where 
Tirpi = V P(C tY AT + LT) 
(Oi V Aier pi + Eiti AiÛkrti), 


i ich U fut T 
in which Upri := (Urpi e Usear_aill 


noi i ifi=r-, 

o |[B ©74,N},,], otherwise, 

for all i € NT, where Ê := By + y~2AIl~!YP, 
By := (V1; — 7 AMY AT), 
Q := (I-y?Y@,)7}," 





and having Ñ, := BIO,V_',A+ PT, then 


z BIO; 1V7 (A+ PT 
Nye [eae , WeNt~*, 
| Niyi V'A : 


Moreover, Y, := diag(Wo,..., U;_1) such that 


— xV — DIGD, if i = 0, 
aa yV — DTO 141D, otherwise, 

for all i € Nj", G in (24) and 

Ý := NHI — 6s CIN), 

D := AS-'CIN-1 + LV. 
Furthermore, 

r es R+ BTOB BOLNI 

7 NıQı B Tı , 
where for all i € Ni~?, 
T; := Al Eip A; + diag(R + 7° PTI HY P, 


\ -1 OT T 
Tipi g Niri 97 Qla Nia) 





such that Y-_) := R + TZ, Ê + y7? PTI tY P. 
Moreover, if for a given y € R.o, one of the matrices A;, 
for i € Nat? in (31) has a negative eigenvalue, then y < y=. 














Similar to Remark 3, we do not investigate the situation 
in which for a specific y € Rs 9, one of the matrices Aj, for 
iE Mee has one or multiple zero eigenvalues. 

Based on (33) and (34), the controller just needs to receive 
{%z, y+} at every t = Tw in order to calculate the control input 
at every t € No. Therefore, for the proposed r-periodic £2- 
controller, the measured outputs after the last transmission 
time do not need to be transmitted to the controller. 


II-E Networked control system for event-triggered commu- 
nication 


The NCS we are interested in the ETC setting has the same 
plant G as given in (1). However, data transmission from the 
sensors to the controller are now triggered by an output- 
dependent mechanism, which is called an event-triggered 
transmission policy. This event-triggered transmission policy 
can be formalized as u = {fi }ten,, where 


ôt = (Ht) € {0,1} (39) 


with 


Hi := {yili E Nb} U {ili € Nit} (40) 


is the information set available for the scheduler at every 
t € No to decide whether to transmit (6; = 1) or not to 
transmit (6, = 0) at time t. Then, any appropriate control 
policy is formalized as R” = {Ri Jeno, where 


uy = RE (FF) (41) 


for a mapping Ri’: F! + R™. The information set F” 
available for the controller contains all the measured outputs 
up to the last triggering time. However, based on the periodic 
time-triggered control policies in (22) and (33), Z, at every 
t EN is a sufficient representative of all the measured 
outputs up to the time step t — 1 needed by the controller. 
This indicates that at every triggering time step i € No, 
instead of {y_,_,+41,---, yi}, the scheduler only needs to 


transmit {Z;,y;} to the controller. Accordingly, 
FE = {2i yili € No A mi (Hi) = 1}, (42) 


where g; is determined based on (26) at every i € No 


in the scheduler. We call a combined event- 
triggered scheduler and controller 1 = (u, R”) 
an ETC. Furthermore, we introduce the average 


transmission rate associated with an ETC policy 7 as 
fn(xo, w) = lim supro F Ea ht(H:) and the average 
inter-transmission time as Gy (to, w) = 1/f, (zo, w). 
Note that this depends on the initial state x9 and the 
disturbance realization w. Next, we define the concept of 
an event-triggered £-controller. 

Definition 4: (Event-triggered —_€-controller) Given 
y € Ryo, an ETC ņ = (u, R“) for the system (1) is referred 
to as an event-triggered f-controller with @2-gain bound y, 
if 

i) the closed-loop control system is globally asymptoti- 

cally stable, and 

ii) there exists a 

satisfying (0) = 0, 
any zo € R”, (5) holds. 


II-F Problem statement 


function 
such that 


@:R" > Ryo 
for all we é%, 














We are interested in decreasing the communication fre- 
quency between the sensors and the controller by employing 
the idea of ETC, while at the same time preserving the 
same ¢)-gain bound for the control loop as the designed 7- 
periodic ¢2-controllers in (22) and (33). This concept is 
introduced in the following definition as an ¢-consistent 
ETC. 

Definition 5: (€2-consistent  event-triggered controller) 
For any given 7 € N and any ¢2-gain bound y € R,9 for the 
system (1), where the 7-periodic @)-controller (33) (or equiv- 
alently (22), in case T = 1) exists, an event-triggered l2- 
controller n = (u, R#) is said to be f2-consistent with Z2- 
gain bound y, if 

i) 7 has €-gain bound y, and 

ii) in comparison to the 7-periodic ¢2-controller (33) (or 

equivalently (22), in case T = 1) with @5-gain bound 
y, the average transmission rate of 7 is lower than or 
at most equal to 1, i.e., fn (zo, w) < 4 for all xo and 
all w € £9. 














Based on Definition 5, the goal of this work is to propose 
an £9-consistent ETC for the NCS depicted in Fig. 1. 


III MAIN RESULTS: £2-CONSISTENT EVENT-TRIGGERED 
CONTROLLER 


For simplicity, in Section III-A first we propose an £2- 
consistent ETC based on Definition 5 for 7 = 1. Then, in 
Section III-B, we consider the general case. 


III-A Special case T = 1 


An ETC consists of two main parts being the controller 
and the scheduler. Inspired by the 1-periodic ¢-controller 
in (22), we propose the controller of our £2-consistent ETC as 





RE(FE) = -RT P! sh, — K (Àt + LNG), 
(43) 











where K is given as in (23), 


The = SY lay: + CIN), (44) 


and Yt, Ti are the estimation of y; and 7, in the controller, 
respectively, at every t € No. When the scheduler triggers 
a transmission, i.e., 0, = (Ht) = 1, {Zt yz} is commu- 
nicated to the controller and, therefore, Yt = Yt, Typ = Zt. 
However, in case of no data transmission, we use the hold 
policy for the estimation of y; in the controller. Therefore, 
having y_; = 0, for all t € No, 


= Yt, 
Yer) ~ 
an 


The main reason for not employing a model-based estimation 
for y4 is due to the fact that within the context of the 
fy-control problem, the characteristics of the disturbance 
input w is usually unknown, where it can take any bounded 
value. Now for ,),, we introduce the following model-based 
estimator, 


f (45) 
otherwise. 


iyi = Afije + Blu + KG + Ky) + Dj — Lie), 


Bs pow i if uH) =1 
Ttt =) ~ 
Ttjt—1> 
(46) 


otherwise, 
where ojo = 0 and y% follows (45) for all t € No. Next, 
we define the event-triggered transmission policy jt as 


l, if =0 
pl (He = f Piae 


or all £ € No, where 





(47) 
otherwise, 








Bi+aijt = Bayt — lye — yi llo + |u? — ut Il, 


J Bae- Wye yilli tf uH) = 1 
F1|t-+1 ‘= ; 
Bt+ilto otherwise, 


for all ¢ € No, Bojo := 0 and 





Be 





up = — RO Pag = K (Aiei PLN h), 
where 


Ži = SHY Zae- + CINT). (48) 


Moreover, for all t € No, Jt—1, Žijt—1, U% and y; follow (45), 
(46), (22) and (29), respectively. The next theorem states 
the €2-consistency of the proposed ETC, i.e. (43) and (47). 
Theorem 1: (€2-consistency of the ETC for T = 1) Let 
Assumption | hold. Then for system (1), the ETC with 
the scheduling policy (47) and the control policy (43), 
where Yt, Tae follow (45) and (46), respectively, is la- 
consistent in the sense of Definition 5. 
The main idea behind the proposed ¢2-consistent ETC 
comes directly from the upper bound of J in (28). As the 
measured outputs deviate from y; at some t € No, it results 
in some negative terms, which provides the opportunity to 
skip data transmissions at some times. In principle, the 
proposed ETC balances the negative terms of cost upper 
bound arising when y: Æ yf, with the positive terms of 
the cost upper bound, when us # už due to skipped data 
transmissions, such that still J < 7*||xo||?--1 holds. 














II-B General case r € N 


Similar to the previous section, we first introduce the 
control input of our desired ETC. Based on the periodic £2- 
controller (33), where the state estimation z; and the cur- 
rent measured output y; are transmitted to the controller 
every T € N time-steps, the control input of the ETC is 
proposed as 








Ri (FF) = 
-RPT aN, — KAY, + LN), if af =1, 
—R-1(B] XH'A + P!)kue, otherwise, 
(49) 





at every t € No, where 77, Tije Yt, Ttt Follow (2), (44), 4 
and (46), respectively, and for every t € N, 


Lerale = Ho Ady, 
ž fas +IN§-1), if my =1, 
tlt = 














Ett-15 otherwise. 
(50) 
Then, the event-triggered policy is proposed to be 
1, if mF = 1 and Bessie > 0 
m (Hi) := 3 eea (51) 
0, otherwise, 
for | t € No, where consider- 
ing UV, = diag(W,,...,V,_1, Vo), 
Ue. .¢ Ykr—-r+1 7 Yer—r41 
UP = aTe = ex ; 
ARTE Yar Yir 
for all « € N, where 
uf = 
-RP ije — K (Ay, +LN tfe), if x7 =1, 
-R!(BI XH! Ä + P) 44-1, otherwise, 
(52) 


where x" 


t\t—1 follows (48), 


i X'G(Agh, +LN tj), if a7 =1, 
ttie H! Ady. -1, otherwise, 
(53) 


Bij = —lyoll%, and for all « € No, 
Bw41)r|ar = Btls -= ||Yx — Yla, O= Ullah 


B _ Basico if eG 1 
(K+1)T|(K+1)7 ° Becti)r|nr» otherwise, 





in which Prckivrler = Pirie = |. z Yells. and Lt\t—-1> 
Už, Y%r—r+1:xr follow (46), (36) and (37), (38), respectively. 

Theorem 2: (€2-consistency of the ETC for r € N) Let 
Assumption | hold. Then for system (1), the ETC with 
the event-triggered scheduling policy (51) and the control 
input (49), where Yt, Tig follow (45) and (46), respectively, 
is €9-consistent in the sense of Definition 5. 

Hence, also in the general case 2-consistent ETC can be 
designed systematically. 














IV NUMERICAL EXAMPLE 


Consider system (1) with 


0.8 —0.5 0.1 0 —0.2 
a ce a a= i al Ba = Fat 


0.1 0.2 0.3 
AS Ee 0 > P= a i 
Cə = [0.5 —0.1], Da = [0.5 1], 


where clearly Assumption 1 holds. Moreover, we take 
we = E7200 [sin( $) cos(5)]", t € No, as the unknown 
disturbance input of the system and zg = 0. We denote 
the infimal @-gain bound for which the 7-periodic ¢ - 
controller (33) exists by y= as before. For the given sys- 
tem, these values are determined by a bisection method to 
be yï = 2.9550, 43 = 3.8055, 73 = 5.1140, yz = 6.8489, 
see Fig. 3. We will show now that the proposed £2-consistent 
ETC (based on Theorems 1 and 2) can result in significant 
improvements both in terms of the @:-gain bound and the 
communication rate. 

To do so, consider Fig. 2 in which the state trajectories 
of two different closed-loop scenarios over a 250 time step 
window are shown. In the first, we display the PTC scenario, 
in which the scheduler is transmitting periodically with r = 4 
and the controller follows (33) for y = 6.8510 = 77 + € 
close to the infimal value yj as e= 0.0021 is a small 
real number. The second scenario is an /)-consistent ETC 
designed for (7,7) = (1,3.0504) based on (43) and (47) 
(note y > yj). The average transmission time period of this 
event-triggered scenario is W,(to,w) = 4.032 ~ T = 4 of 
the PTC. The comparison of these two scenarios indicates 
that it is possible to guarantee lower @2-gain bounds for the 
system (1) by employing the proposed f2-consistent ETC in 
comparison to the periodic ¢2-controller in Lemmas 2 and 3, 
while they both have almost the same average transmission 
rate from the scheduler to the controller for the corresponding 








ETC (7 =1 , y =3.050< yf), © (xo, w)=4.098 
— — — -PTC (r =4 y =6.851= 77 + €) 
@ = Triggering times of the event-triggered scheduler 
































Time 


Fig. 2: Illustration of the higher disturbance attenuation 
(lower state trajectory amplitude) of the proposed ETC 
designed for (7, y) = (1,3.050) in comparison to the PTC 
designed for (T, y) = (4, 74 + € = 6.851) while it needs the 
same average transmission rate (for the given disturbance 
input w). 
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Average transmission frequency (1/7, fa(w)) 
Fig. 3: Trade-off curves that result from the provided Z2- 
consistent ETC designed for 7 = 1 and 7 = 2 and the given 


disturbance input w in comparison to the one that can be 
achieved by PTC. 


disturbance input. The better disturbance attenuation of the 
lə-consistent ETC can be easily seen in Fig. 2 and is 
guaranteed by our theory. 


Fig. 3 shows the trade-off curves between the ¢-gain 
bound and the average transmission rate from the scheduler 
to the controller both for the proposed @-consistent ETC 
and the designed periodic time-triggered f-controller. The 
stair-type curve indicates the trade-off values one can achieve 
by following a periodic ¢-controller. However, the dotted 
blue and the dashed red lines indicate the trade-off values 


resulting from the provided ¢2-consistent ETC for 7 = 1 
and T = 2, respectively, for the given disturbance input and 
a time horizon of 250 and zero initial condition. As it can be 
seen, the proposed 2-consistent ETC can result in the same 
Lə performance level as the periodic ¢2-controller, however, 
with a much lower average transmission frequency. In other 
words, Fig. 3 shows that the proposed ¢2-consistent ETC 
can result in much better trade-off values between the £2- 
gain bound and the average transmission rate in comparison 
to PTC, thereby demonstrating the strengths of the proposed 
ETC algorithms. 


V CONCLUSIONS 


In this article, we first followed a game-theoretical ap- 
proach to design a periodic time-triggered ¢2-controller for 
a given f)-gain bound y and a discrete-time linear system 
with output feedback. The concept of information state [24] 
was employed throughout this controller design to define 
a minimax optimization problem, where the controller acts 
as a minimizer and the measured output is the maximizer 
of the cost function. Based on this setup, we proposed 
an event-triggered ¢2-controller, which guarantees the same 
fy-gain bound as the designed periodic time-triggered £2- 
controller, however, resulting in a lower or at most the 
same average data transmission rate from the sensors to the 
controller in comparison to that of the designed periodic 
time-triggered ¢2-controller. A numerical example showed 
that the proposed policy can, in fact, save considerable 
amount of communications in comparison to its periodic 
time-triggered 2-controller counterpart, while guaranteeing 
the same level of @j-gain bound for the control loop. 


APPENDIX 


A Proof of Lemma 1 


The proof of this lemma is provided as complementary 
material due to the space limitation. 


B Proof of Lemma 2 


First we solve the finite-horizon zero-sum quadratic dy- 
namic game (15), then we discuss the conditions under 
which the stationary version of the determined control pol- 
icy is the solution of the 1-periodic 2-control problem 
according to Definition 2. First of all, we can easily show 
that q(£r) = x}.Xaxr, where X follows (19) [22]. Now to 
solve (15), we follow a dynamic programming approach. 
Therefore, we should determine a value function at ev- 
ery t E€ NTI, i.e., Wi : R” — R, which satisfies 


min 
wea {u= Rp (Fp )} 


W, (ē:) = 
t (z+) t=T EO] 


kEN kengt 


max [pr(xr) + £hX rr], 
srER” 
(A.1) 


for r = 1, where z, follows (10) and 

Wr(īr) := maxeper[pr(er) + ch Xer] = maxz per» 
[-y(ar — Br) ¥p (ar — Er) + Zr + ahXa7] 

= maxepern[—-Yep(Yp -V° X)er +24 Yr Er] 
-42 Yr Er + Zr =P EY p (Yr -y °X) Yp Er 
— PalYp Er + Zp = TLX (I -y ° YrX) ler + A 








under the condition that 


Qr := (I — y °Yr X)! > 0. (A.3) 


Note that if Qr has a negative eigenvalue, then Wr (gr) is 
unbounded (necessity of the condition). We can also show 
that for any linear system, when the controller has access to 
the output vector at every t € Ne then 

Wilt) = TI Xiu + Ze, (A.4) 


at all t € NJ, where Q := (I — y~Y,X;)7!, X is a pos- 
itive definite matrix, which will be determined based on an 
iterative Riccati equation in the sequel, and Y;, Z; follow (9) 
and (11), respectively. Now for an arbitrary i € Nios 
AW; : = Wi41(Gi41) — Wi (ti) = Fh Xi Qiii 
+ Ži+yı - ZI XQ — Zi = a! Qik: + ul Rit; 
-77 
We now substitute Z;,, based on (10) in the above equality 
+ (Ajz: + Biti + Digi)’ Xi41Qi41 (Aiki + Bit; + Digi) 
= 8] (Qi — Xia): — 7G] Vigi + (Gi — UF)T 
(Ri + BY Xi410i41Bi)(G; — UF) 
+ (Aii + Dig) Xi4 141 Ls (Ait; + Did), 
where 
Li : = I — B;i(Ri + BY Xi410i41 Bi) BY Xin Qiga 
D p-lp —1 y-1 D. p-lpt\—t 
= 1- BiR; BY (04), X4 + Bik; BT) i 
= (Xiu) XA -y Yi + BiR BI", 

in which the second equality is determined based on the 
following matrix inversion equality, 

CD(A + BCD)" = (C7! + DAB) "DA". 


Moreover, by considering 


Vii + Tl Xi Niyi — T] X:N. 





Gi+ı := (Xai zs pm orn ZE BiR BI ', (A.5) 
©; := Ri + BY X11 Bi, l 
we have 
a= -7t BIT X; Qiı (Aizti + Digi 
É ; Bi 104416 yi) (A.6) 





= R; B] Giy (Aix; + Diy). 
Then, under the condition that UV; := y? V;— DI Gi+1 Di > 0, 
AW; = 5] (Qi — XiM,) Bi — PTT Vigi + (Gi — UF); 
(ua; — uF) + (Aiti + Digi)! Gigi (Ait: + Digi) 
= 7] (Qi — X04 + AJ Gipi FAs) Bi + (Ui — G)T®; 


(a; — U7) — (Yi — FH) Yali — GF), 
(A7) 





where 
F; : = I + Di(% V; — DIG Di) DI Gizi 
z Ga (G EY *DiV, D7) Sa 
G2 = (Vi — D] Gis D:) DI Gi Ai. 


We can show that V; > 0 for all i € No if Q; > 0. For 
this purpose, we have 


CoS CINT! = Ca(Y7' + CIN!Ca — y7’ Q) GIN 
= 02(¥,-* — PQ) ICIN + C(Y7' — yQ) CI), 





based on assumption that N > 0. Then by substituting the 
above equality into V; defined in (12), 
-1 


V; = (N+ OU -—77¥,Q)1Y;,CJ) ~, 
which indicates that V; + 0 for all i € Ne due to the 
fact that I — y~2Y,;Q + I— y~?Y;X;, and based on the 
assumption that J — y~?Y,X; = Q7 ' > 0. 

Now if Gi+ı > 0, and Gh — 7? D; V; D] = 0, by using 
the matrix inversion lemma for PDI Giyi and pt, we 
have 





g =° V DI (Gi — YO DV; 


'D}) 


and 





UA i = 2V; ‘(+4 2 DI (GAY °DiV;, “DE * DV, 7 


Therefore, based on the above equality, if Gj41 > 0, 
Gai -—y 7? DV, "DI > 0, and Q; > 0, then Y; > 0. 
We shall shortly see that Q; > 0 is enough to guarantee 
w;> 0 (see the discussion after (A.11)). Now let us sim- 
plify DiV; + DT = a; + 26; + yi, where 

S 'CINIV NCS AT 

Š-1CINTHI — 658 “OLN ICS! AT 
($; — CIN'C2) tC] N 1C, "AT 

a = Oy Ua e 20s) At 
IG, = ÅS HAT = AGL +Y; 5. “Ar, 





l l 








II 
D Dœ Be 


yi: = LV;LT = LNT!LT — LNCS CI N“117, 

Bi : = ÅS CI NTI LT. 

Moreover, Y;+ı = AS;>'AT + B, BT — LN-'LT. There- 

fore, 

Viet oe. 'AT + B BT + 248, MORN oi 
tA V8 ATEN Oss CINILI. 





(A.9) 
Now, let us simplify B,R,* BY = ni + 2¢; + éi. We have 





Ryt = R oa Ro Ps PR. 


Furthermore, 

i= BR; |B] =B2R-'BI—7~-*B.R-1P'S;'PR-'Bi, 
G = 7° Ba(R +772 PT S7! P) PTS At 

=? BR! PTS! ÂT, & := y7 ÂS PR PTS 1AT, 








Finally, considering H; =I +(BR-'BI 
Guy -y °D: V D] = Xi -yY + BiR; BI 
-q ° DV," DI = Xi -y 7? BiB] + BRB] 
= y? All; Y; AT = H; X75 = y~? ATL; tY; AT. 


— 7? Bi BY) Xin, 


Therefore, by considering 











Xi = ATXi4,H71A+Q, (A.10) 
we have 
(Guy, -y ° DV," DI) = (AX) -y ° Ally YAT) 
= Xi H +y? Mai A * ee -~7?Q 
-7° (Xi - Q)) ATX Ht 
= Xip H7 (I +y AGY AT Xiz H7). 
(A.11) 
This indicates that if 
Q; > 0, (A.12) 
then Ga - ~~? DiV,-' DI > 0. Based on (A.5), (A.12) 


is also enough to guarantee G;}ı > 0. Therefore, (A.12) 
is enough to guarantee Y; > 0 for all i € No (see the 
discussions after (A.8)). Moreover, based on (A.11), 


ANG i Fi Aj = AHX A E, 2 ATT; 1Y; ÄT) 1 ATI; 1 
= Ï TÄT X; H7 ÄN; = X: + Å T (TX: H7 1A 





— (I-y?QY) Xi); = -QÜ + Xiu = -Qi + Xi. 
(A.13) 

Therefore, based on (A.7) and (A.13), we can conclude 

AW; = (ti — U7) Oi (Gi — 7) — Gi — GF) WAG — GF), 


and based on the definition of AW; for all ¿i € Nas we 
have 
Wr(ér) — = AWS 


However, since Zp = 0 and Zp = 0, then Wo (zo) = 0. More- 
over, 


T— 
Po (xo) a yar [e] zi _ 


Wo(Zo) (A.14) 


wl wi pr (xr) + IX pip 


Wr(Zr), 


where the first inequality is based on Definition 3 and the 
second inequality is based on (A.2). Then by substitut- 
ing Wr(%r) from (A.14) into the above equality, we get 


T= T 
Sa [z] zi a ywl w;] < —po(zo) + D AW;. 
(A.15) 
Now let us simplify the control and disturbance policies. We 
have 


<S 
<S 





(Gay, -y °D: V DITA, = Xi H; 1 AQ;. (A16) 


Then based on (A.8), y; is simplified as 
J =q? V! DI Xia, H7 | AGE. 


We can also show that V D= = CI VAL + LT, Then, 


Y; = TA + Yi = (C2 + 7 LT Xip H; Å) QT. 


Moreover, 


Rs = R- BE +y? RIPT! AT, 
A, = DiC- Ñt = ÅS tY t, 
Āiūi + Dig = as (Y; tzi + CIN tyi) + LN "yi, 
and 
a Ana -yY + BR, BI = Xai 
~?(B, BT — LNT! L!) + ByR' BI — y7’ ÅS! AT 
= ae +77 LNL — y? AS; 1 AT. 
Then by defining z¥ = $- ' (Y; 'z; + CJ] N~1y;), we have 


*=—R-'pigt — R-1(BI +y? PTS! AT) 
7? AS>1 AT) 


Gtx. zi +y 25N 1ET 





(A.17) 

Part i) For the infinite horizon problem, we have to con- 

sider the stationary version of the Riccati equations (A.10) 

and (9), where their solutions are X and Y, respectively, and 

take Yo = Y. Therefore, Y, = Y for all t € No and instead 

of (A.3) and S; > 0 for all t € No and any T € N, y € Ryo 
is needed to be selected such that 


yI-YX = 0, (A.18) 
S=Y-1+CIN-1C2-—y"Q +0. (A.19) 

In addition, it is necessary to have 
?I—- BIXB,>0, (A.20) 


which guarantees H = I + (B2R~'B] — y7? Bı BT)X > 0 
following [26, Lemma 6.2]. One also needs to prove the 
necessity for the existence of X and Y (the stationary 
solutions for the Riccati equations (19) and (20)) for any 
y > yf. If Xs = ATX H; Å + Q diverges as t > 0, 
then the corresponding zero-sum minimax problem used for 
the determination of the state feedback ¢5-controller does not 
have a bounded upper value, see [22, Lemma 3.6], where 
in this case it is proved that y is less than the -gain of 
the system [22]. Furthermore, if (9) diverges as t —> oo, 
then Y! — 0 and based on (A.1), max,,er»[pr(a@r) + 
x Xrr] — +00 for a sufficiently large T € N. This 
will result in an unbounded .J* defined in (15). Now based 
on (A.15) 


J <P rY tro + Dy AW: (A.21) 


where, 


AW; : = (ui — uj )TO(u yi) Y(Yi — yi) 

in which ® and W are given in (30). Therefore, u; = už 
at all ¿€ No guarantees J <y°x}Y tro. Moreover, 
given the observability of (Q2,A) and the controllabil- 
ity of (A, Bı — LN! D31), the solutions of the Riccati 
equations (19), (20), ie. X and Y, under the condi- 
tions (A.18), (A.19) and (A.20), are positive definite. The 
proof of this argument is similar to the one presented in [22, 


Lemma 3.5]. 


i— u) (yi 


1(AgY+LN-ly,). 


Now we will show the Lyapunov stability of the control 
loop for u; = už at all i € No. For an arbitrary t € N, we 
have the following inequality based on Definition 3, 


Po(Xo) — pi(z+) < 0, 
where p(x) = —9° (£1 — T) Y t(x — T+) + Ze. More- 
over, similar to (A.14), we can conclude 
Wilz) — Wolo) = Xi AW, 
where W(t) = ZIX (I —y°?Y X) Ti+ Zi Then by 


combining these two last expressions, we get 
W, (21) — pi (2+) — Wo(ē0) +po(z0) < Xia AW;. (A.22) 


Now let us take V(x, %4) := —pi(xt) + Wilz) as the 
Lyapunov function candidate for every t € No. Then based 
on (A.22), we have 


V (£1, z4) — V (z0, Z0) < ig AW;. (A.23) 
Now if ui =U at all i € No, 
then V(x, z+) —V(x0,%0) <0 for all t © No, which 


proves that the control loop is Lyapunov stable. 

Finally, we should prove the convergence of x; to zero as 
time goes to infinity, when w = (0,0,...) and u; = uj at 
all ¿ € No. Based on (A.21), in this case 


tS oe eee EY r: 
Let us define vf := už + R~'PTa,. Then 
zp a = 2} Qa, + 227 Pus + us? Rus = cl Qa, + {7 Ruy, 


and x44, = Ar, + Bou; = Az, + Boy. Now due to 
the boundedness of J, we can conclude that Q r — 0 
and v; — 0 as time converges to infinity. Then based on 
the observability of (Q?,A) we can show that 2, — 0 
as t + oo. 

Part ii) This part is already proved above, see (A.21). 


C Proof of Lemma 3 


Let us first introduce another formula for the dynamics 
of x, and Z, besides (10) and (11). Consider Zo = 0 
and Zo = 0, then for all t € No, 
= Aiti + By, + Dibe (A.24) 


Tt+1 


Zt+1 = Zt + zd: + ûl Ry ti, = YOT Vde, (A.25) 


where Åi, Qi and 1 are as given in (12), Il; := I — y~?¥.Q 
and 

Êi = Ba +y All; "YP, D. 
Ri = R+? PG YP, V, = 


= ae + LV,, 
-HI C55 CIN"), 


(A26) 
Moreover, at all t € No, te = Ut — Ut and 
De = yt — (Ye + v) for 

tly = -K it = Lits, KH = Liu, (A.27) 
where K, = Ro PTI}, L; = Coy}, by = y7? C207 Y; P. 


We can prove the above following the same steps as the 
one for (10) and (11). 


Now when the information sets of the output generator 
and the controller follow (16) and (4), respectively, for 
a given 7 € N, then the value function defined in (A.1) 


at all transmission time-steps to the controller, i.e., when 
€=(k+1)7 € N71, follows 
W(X) = =I, TX Ep + Že, (A.28) 


for Xe = XoQ and Qg := (I — y7?Ye Xe) t, where Xe is 
an appropriate positive definite matrix determined based on 
(A.10) and Y;, Z; follow (9) and (11) (or (A.25)), respec- 
tively. Now we will determine the value functions at non- 
transmission time steps. Consider 1 = «7 € No as the last 
transmission time-step before £ = (« + 1)r. Then, based on 
Isaacs equation, we have 


max W,(Z,). 


W,(z.) = max min _ max i 
Ge-1€R° 


J ER? U,ER™? G,41ER° 
Let us denote r = £ — 1, then 
W, (Zr, tir) = m z3 T Xote + Ze 
= max (Ar En + Bolin + DyGr)' X0( Ar Ey, + Brtip + D,Gr) 


+ Zp +21 QrE, + GL Êr ûn — 9791 Vir, 


which results in 
7 =(7°V, — DIX,D,) DI X(Apz, + pûr), 
under the condition that 


yV, — DI X, D, > 0. (A.29) 


Note that if PVs DIX, D, has a negative eigenvalue, 
then W,. (Tr, Gp) is unbounded. Now let us dme = Z := 


(X;1— 7-?D,V,-!DT)—1, then by substituting ġ* into the 
cost function, it results in 





W, (Tr, Ur) = Zr + E7Qp Ey + (ur — tip) R, (ur — tip) 
+ (Å, — Bpitp + Brus)" Ee(A,2r — Brù, + rur). 
For an arbitrary r € N ao we have the following equalities, 
A, 8, — Bptip = AI; tE, = Arr, 


ürk rū, = 2, TPR PII +Z], 


fi; PRPTHI + ĝ, = QU’ =ĝ,, 
Up Rrr = —u, PIL, 2, = —u,PTE,. 
Therefore, 
W, (Er, ur) =: (Ånn F Byup) TE ¢(ApZy + B,ur) 
+ Z, +21Q,z, + ul Ru, + 2u, PIT, 
= £10,2, + 2utN,2, + ul T,ur + Zr, 
where 
O, = ATE, A, + Qr, Ñ, = BIE, A, + PT, 
T, := R, + ÊT, b, 


Moreover, for any t € Ngt, DVDS =a;+26;+%, 
where 

= AS>'C]N-1V,-1N-1C)S71AT 

= AS; 1 NE — CaS CIN) C38, t AT 

= A(Si — CI N!C2) CIN !C2S7'AT 











= AYT -7° QI- Yr- 77’Q)ST')AT 
= AI (Y; — IS +) AT = A(I7 Y; — S7')AT, 
yi: = LVLT = LN-1L’ — LNCS CI NTILT, 
Bi : = AS CINT LT. 
Therefore, 
Yiai + DiV,-1D? = ÅS + ÂT + By BT — LNTILY 


+ A(G; tY; — S7 ')AT — LNCS CINILI 


+LN L +2AS7'CI N'I = AIL +Y; AT +BB]. 
(A.30) 





Moreover, based on assumption NV > 0, and 
CaS CINT! = 03(¥, *+CIN-'C,—7-7Q) ICIN! 
=N r O OIN ka = g 


Then by substituting the above equality into Ŷ, defined 
in (A.26), 





yo = me OVER) 

which proves that V,>0 for all te Ngt, due to the 
fact that yt —7~?Q is assumed to be positive definite. 
Consequently, we can show that 


Vi = (N + Ca( 





=-1 =X; Key 2D, V; ‘pt =X; -yY 
-PÔV D] = X,* —-y?(BıB] + AI 'Y,AT) 
in which the second equality is based on (A.30). Then we 
can conclude 
0, = IL TAT(X7 t — 77° B1B] — y~? AIL Y, AT) AI" 
+ QU! = ITA (X7 — y7?’ BBI) A(I -7°Y,Q 
y?Y,Al(X7) — 2B, BT) 1A)? +I 7TQ, 








where by considering 


©, = ATX,(I — y~?B, BT X) TA +Q, (A.31) 


and Q, = (I — y~?Y,0,)71, 
6, = I1 (0, — Q)(I—y-?Y,Q — 772¥;(O, — Q)) 
+ ITQ _ MT (O, r Q)Qr T ITQ am OrQr. 


Now we will simplify N,. Based on previous equation, we 
have =A, = (X7 * — y~?BiB])-1AQ,., and 

BIS, A, = (B+ ° AN; Y, P) (X7 -7° B1 BI) AQ,. 
Moreover, 

y? PTI Y, A (X7' — y’? B BI) TAQ, + PI + = PT 
I (y ?Y,AT (Xp + — y? B BI) 1A+1I-y°Y,0,)Q 


= P (y7°Y, (O, = Q) + f= 7 Y,O,) Oy = PIO 


Therefore, 


N, = (B3(X7' —y~?B, BI) A + P™)Q,. 





(A.32) 


Now by an induction argument, we assume that at an 
arbitrary time step t, where 1 < t < £, the value function 
follows 
W1(%:, Ur) = 21012, + 2U] Niti + UT VU + Zi, 
(A.33) 
where U; = [uj] ... ut]™. Consider s = t — 1, then 
[z] Ort, + 2U] Nit, + US TU + Z| 


= max [(A,@, + Êsûs + Dsĝs)1O:(AsTs + Betis + DoGs) 








+ 2U7 N,(A,€, + Bots + Des) + UJ TU, 
+Z: + Owes +F al Reis = yol Veis]. 





Thus, under the condition that 


xV, — DIO, Ô, > 0, (A.34) 

we have 
g = (PV, — D}O,D.)~* DY (0; (Ast. + Botts) + NIU;). 
(A.35) 


It is worth mentioning that if Vz = DI 0,D, has a negative 
eigenvalue, then Ww, (Z,,U,) is unbounded (necessity of the 
condition (A.34)). Now by substituting 7* into W.(Zs, Us) 
and having 





A,:=AUz*, Qs:=QU;1, PI := PTI}, 
=, = (6,1 - y? Ô, V DIE, 
we get 





where 
O; = ds ATS As, 
— BI — 7 PI = Pir PI 
Na hens EAs s Y,;= 21 
[méala [S] t a al 
(A.36) 
in which 


Pu = Rs + BIEB, Po = N,O;'2:B,, 
Po = T, +7 ° N D.V DIE, O NF. 
We can also simplify P22 as follows 
Po = Ya — MOTINI + NO INI +y N, DV) DI 
ZO NI = Yi- NO NI + NO RO NI. 





Therefore, W, (Zs, Us) has the same form as Wi (22, U+) and 
the assumption of the induction (A.33) is correct. Then by 
considering L, = I — y~?B,B] ©, and 


O, = ATO,L,1A+Q, (A.37) 


we can show that ©, = 0,2,. Moreover, 


=r! = O7! —y7?(BıB] + Allz!Y,AT). (A38) 


Now if we assume 1 = t — 1, then 


W,(@.,%) = min [2] Oiz: + QU) Ni + US TU: + Z] 
U, ,ER™T 
= min [(A.2,+ Bū, + Dy.) O,(A,2, + Bu, + Dy.) 
U,ER™t 
+ 2U/N,(A,z, + Bu, + Dy.) + UF TU: 


ey Z T TIQ,T, T ul Rū, =~ PIV, ga], 





where Û, = [jū] UJ]. This results in 


U* = —K,(A,z, + D.J), (A.39) 

where 

Pa R, + BIO,B, BINI = BIO, 

ee NB, Te Ne | 

Moreover, let us introduce the following matrices 

R, 0 BI 2 
ne | u S and piz Ler Saat 
then 
K, = (A+ BCB™)'Bc = A“'B(C + BTATHB) 


RBI 
5 a, — NO NI) tN, Or! 
+O NI (Y — NO NI) NO7). 


tert 5.8 


We can use the following two inequalities 


(T: bop N:O; TNE) 'N,O; 1 — Y; NX; 3 








= A ter dae tL get 2 A.40 
Ə! +O NIY N Xt = X;}, ey 
where X; = ©; — NI Y7 'N,. Then 
_ R-' Br 
a Ee Gi; (A.41) 
t 


where G, = (X7 t + B,R7!B1)-!. Moreover, by replac- 


ing Û, with U* in W, (Zi, 9.), we get 
W,(z,) := max |Z, + 27Q,2, — 7 oTVi. 

GER? 

T (A,z, + Di.) G4 (ALE, + D,y.)] . 

Therefore, 
y z (VV, 7 D'G,D,) 'D'G,A,z, 
=94 V D] (Xp? + B.R; 'BI -y °D V DI) Aa, 
if WV, — DIG,D, > 0. As we showed in the proof of 


Lemma 2, this condition is equivalent to S, := xX; Dees 
y?D.V, DT + B,R7!B! > 0. Moreover, similar 
to (A.11), we can show that S, > 0 if 

yI -Y,X,>=0. (A.42) 


Therefore, 


H =y V DIST Atu 


Moreover, assuming 





X, = Ã(X Z4 —777B B] + BgR-1B])-1A+Q, (A43) 
then similar to (A.16), 

S'A, = (X34 -y7 ?BıBI+B:R7!BI) 'Ä(I—y7?Y,X,) t 
Therefore, 


y* = y? (C207 tY, AT + DAL — y7?Y,X,) t, + C2 
Üz, = (C2 +y CL X, H7 A) I — yY, X) E. 
(A.44) 
Moreover, based on (A.39) and (A.41), 


Then, 
u% = ù +, +U; = -R Ps — R! 
(BI +° PTS ÄT (H, X34 +y LNL 
— q7? ÄS t ĂT) HÄTZ! + LNT'y,), 
7 (A.45) 
where z“ = S~'(Y,-'z,+C]N~'y,). Now let us sim- 
plify Už, where based on (A.39) and (A.41) we have 


Us = -YN X; GA, + Dga). (A.46) 
Assume l=t+1, K, = Er Nn, and consider 
= A+ BCBT where based on (A.36), 
Ê; 0 ÊI z 
= S vacii B=]|- #2 CE; 
Ao aie RAT i Fea i : 


Then, we have 





É, =—-Y7!N; =— (A+ BCB)! BC À 
= ou 
— (A+ BCBT)"} K = K} + K?, 
where 
(A + BCBT)'BC = A“1B(C~' + BTAT!B) t 


Sill, ee ict | one a 

(YT, — MO NI) INO |> a 
+ BR, BI + O71 NT (T, — MOND NO +). 
Again the inequalities in (A.40) hold and by using them 


i Rene, =i 
K; a S; Ar 
Moreover, 
(A+ BCBT) = (I — A“'B(C~* + BTAT'B) IBT) AT! 


-R BIS Or NT 


_[ 1- Ry BIS; "B, -Ry BY. ‘NT 
LT; Nix; SrO NI 


-TNX "8," É 
Êy 0 

0 (fi-N O NI)! 
= b Ry‘ BIS; ' Bi) Re" | 


-TNX tS BR! A 
and finally, 


R= M 





(I — Ry'BIS ' Bi) 


Ses VUJ RIPT, 
Ba hae 


Therefore, 





z N k D-1 PT 2 1 pt 1 DT 

R= Rl R? = —R, a Bee Act BR, PI) - Ry P; 
-Y; NX; Si A Ag Behe PI) 

ı Moreover, we have 

E ‘pT = (R+y?PtI YP) PTI! = Ro PtH, 


Ê Ry P7 = A,. Furthermore, 
= XH, AI — y~?Y,X;)7}, 


and we can prove that A, — 
we can show that S, ‘A; 
therefore, 


X Sr! At = 
Then, 


Pi R;1 BI XH AI -— YX) + R-'Ptiy} 
Na Xe (AT HPV X)! 


(I—7y?Y,X))H; AU — y? YX)! 





Moreover, R; ' ÊT = R-1B] +y? R-!P1Î; 'Y,ÃT, and 
qe PML tY, ÄT X H7 AU — y7?Y, X)! 

+ RP! = R prii (YX: Q) +1 

_ y?Y:Xt) (I -y 2Y, X) 1 2: R 1PT(I — y? YX)! 

Then by substitution into (A.46), it results in 
RH(B]X H7 A+ PI)(I — y7?Y, X)! 

TMU -YXHT AU -yY X)! 

X,'G,(Az* + LN-1y,), 

where Z° = S-1(C]N-1y, + Y7 i). By following the 

same procedure and decomposing Ty 1NI , we can rewrite 

the control policy based on a state Saor as follows 








U =- 


ž T A 
Ts+1js = H; Agala: 


es it X71G,(Az*_, + LN-1y,_1), ifs=rk+1 
= Êsjs—1; otherwise, 
(A.47) 


where s € Nj. Then at every s Æ TK, us = K5%5|5, where 


K, = —R7' (BI Xea1H;'A+ PY), 


and when s = Tk, then už follows (A.45). Moreover, based 
on (A.35), when s Æ KT € Ngt, SE Nea and t= s +1, 
then 

93 =(7 V,- D18; D 


s) DIO; (As. + [Ês Or Ñ]] Us), 


and when s = £ — 1, then 


9% :=(7 V, — DIXD.) DIR (A.T, + Ê 


we 
Q 
G 
Nae 


which can be simplified based on 
(PV, — D10, D.) DIO, =y? V DIE, 
= = (97' — 7 7(B, B] + ATZ +Y, AT)) t, 
Z4, = O,L7!A(I — 77?Y,0,)7!, 
VIDI = C7 tY, AT + LT. 


Then for every s € Nf eee y* = Je +% +07. Moreover, based 
on (A.32), (A.36), 


Nea = (BIX -7° BBI) 1A + PY) Qe, 


and for every t € N‘7?, 
nix [ Ga gm At Pe 
N Opa (Onh -77?°BıB])™A 
Furthermore, when s= «T =.4, then y* is determined 


in (A.44). Now following the similar procedure as in the 
proof of Lemma 2, 


Qi. 


AW, := We(&e) — W,(2,) = Welte) — We-1 (2-1, ue-1) 
1—2 

ou 5 [Wip (Ze41, Ur41) — W: (Zt, U,)| 
t=14+1 








TT WŴ,+1(Ē41, U,+1) Aa: 


W, (2,9) + W, (2,9) — W, (E, 
= (U, — UŽ)" ®; (U, — US) Y; 


~(%- Fy N 


for Y, = [y] .--yg_a]™, Y = [y?™---yg@al" 
or F +BB, BINT, 
$ Ni41B, YTi+1 i 


and W7 := diag(W,, .,W,47-1), where for t€ Nig 
Wi: ys 24, — DI 1 Dı and y, : = VV, — DIGuiD,. 
Then by taking the summation over both sides of the above 
equation at all transmission times, we get 


Wr(@r) — Wo(Z0) = J2 AWar- 


Therefore, similar to Lemma 2, we can show that 
Pale T= 
Jio [2i 2 — —po(xo) + ozo AWer 


(A.48) 
in the following, we will prove parts i and ii. 
Part i) First we derive the conditions under which a 7- 
periodic ł2-controller exists. Based on (A.29) and (A.34), 
we should have 


Cs = Vp, = DIr Dr > 0, 


Yul wil] < 


(A.49) 


for every h¢Ni~', where according to (A.31) 
and (A.37), On+41 is determined based on (32). We know 
Ŷ, > 0 and by assumming ©); > 0, for all h € Ni: we 
have the following equality based on the matrix inversion 
lemma 


Cai = ae + a *V Di En Dn V; t, 








where ZEp}1 = (me —y 2D, V; 1 T)=1, This shows that 
Cea is positive definite if =), 41 is positive definite, assuming 
that O74: > 0. Furthermore, again by using the matrix in- 
version lemma and assumming ©),41 > 0, for all h € NIT D 
we can rewrite =p41 as follows 


Zr+1 = Onga + Onti DhChy D Onyi. 


This shows that =p,41 is positive definite if Cs ag. >, 
assuming that Ons > 0. Therefore, the condition (A.49) is 
equivalent to Ziri > 0, for every h E€ NĪ] `, if On41 > 0. 
Then according to (A.38), we should have 


Z = Op — y (AT, Yn AT + Bi BI) > 0, 
gal 


for all h € N 1. On the other hand, Sa = Rack 
y~?Yn41. We can easily conclude that All, ly, AT + 





(A.50) 


B, BI > > Yh+1- Therefore, satisfiying (A.50) will automat- 
ically satisfy O,41 > 0, for all h € NȚ`+. Moreover, for 
the infinite-horizon problem, the Riccati equations (A.43) 
and (9) converge to stationary positive definite matrices X 
and Y, respectively, for which the conditions (A.18), (A.19), 
and (A.20) are required, following the similar reasoning as 
in Lemma 2. 
Then according to (A.51), we have 


J E poe, [2] ži z “ayy mG + pape AW,wr, 
(A.51) 
where 


AW... =(U,—U*)T® 


yw! wi] <7 


(U, —-U¥)-(Y.- Y>) %,(Y,-Y;), 
in which U* and Y* are the stationary version of the control 
and output policies, respectively, which are introduced in 
the Lemma. Now for U, = U¥ at all « € No, we can easily 
conclude J < 722) Y~' zo. 

Global Asymptotic stability: Similar to Lemma 2, having 
Q, = (I —-77?Y X) t, War ltee = T1, ce + Zur 
and Dae hae) = — (Ler Zi Big Y (Tks = Zur) + Lars 
then 


Wer(Znr) Wo(Zo) + po(zo) < pia AW. 

(A.52) 
Now we take V (£ur, Tur) := —Prr (£ur) + War (Env) as the 
Lyapunov function candidate for every x € No. Then based 


on (A.52), we have 


AV (£ur, pr) = V (Eur, Eur) —V (£0, Z0) < Dy AW. 
(A.53) 

Now if U, = Už at all oe € No, then AV (2x7, Zur) < 0 for 

all x € No, which proves that the control loop is Lyapunov 

stable. Following the similar arguments as in the proof of 

Lemma 2, we can show that x, — 0 as time goes to infinity, 

when w = (0,0,...) and U, = UF at all ve No. 

Part ii) This is already proved in (A.51). 


— Pkr (Gig) z 


D Proof of Theorem 1 
Based on (28), to show that the scheduling and control 
policy (47), (43) is an 2-controller with an £2-gain bound y, 
firstly, we have to show that for any realization of y+, t € No, 
DORE- up) O(RE — w) - 


t=0 


(ye — yt) Vy — ue) <0, 


(A.54) 
where at every t € No, RY follows (43) and už, yř fol- 
low (22), (29), respectively. According to the scheduling 
law (47), for every t € N, 


t 

Birit =X (Rii TU (Riu; Hyi)" Bly) < 0. 
i=0 

Therefore, (A.54) also holds. On the other hand, based 

on (28) and (A.54), we conclude J < 7725 Y~ tzo. 

Now we need to prove the Lyapunov stability 
of the proposed event-triggered control policy. The 
inequality (A.23) also holds in this setting for 
al teN and Sa AW; = Bte <0. This results 


in V(a,,%:) -V(a#o,%o) <0 for all t¢€No, which 
guarantees the Lyapunov stability of the control loop. 
Moreover, similar to the proof of Lemma 2, we can show 
that for w = (0,0,...) and the proposed ETC, the state 
converges to zero as time goes to infinity. Therefore, the 
control loop is globally asymptotically stable. 


E Proof of Theorem 2 


The proof is similar to what we presented in the proof 
of Theorem 1, however, we just need to consider (35) 
and (A.53). 
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Complementary Material: Proof of Lemma 1 


To solve the optimization problem in (6), we fol- 
low a dynamic programming approach, starting from the 
initial time and moving forward. By assumption, we 
have po(xo) = -4 x] Yo ‘vo. Now to find pi(a1), con- 
sider x := 21, Ç := Xo, Y := Yo, U := Uo and w := wo. Then, 


pi(a) = max max {po(¢) + 6TQ¢ + uT Ru + 2¢7 Pu 


— wtw; y= C+ Daw, x = AC + Biw + Bou}. 











Based on the dynamics of y, we can parameterize w as 


w = D} N+ (y — C20) + Dy, (A.55) 


for an arbitrary v € R? and D := I — D3, N`! D1. Then 
we can substitute the above equality into the state dynamics, 
which results in 





x = AC + LN! (y — C2¢) + BiıDv + Bou. (A.56) 


Moreover, we have 


Bol u, y, v) : = T(V YS" — Q) + uT Ru + 27 Pu 
=y wtw = =° CT SoG + 27°C (CI N'y +77? Pu) 
+u Ru- v7 Dv — °y N'y, 


where So := Yo | + C}N~1!Cy — y~?Q. Assuming So to be 
invertible, then 


Bol, u, y, v) = =° (C — C) Sol — ¢*) + uT Ru 
— UT Dy — yy Ny + eS, 
(A.57) 
for ¢* := Sy'(C]N-!y+~7-?Pu). Moreover, based on 
(A.56), 





¢ =Atb+ (I — AT Â), (A.58) 


where Ai is the Moore-Penrose pseudo-inverse of Ay €€R” 
is an arbitrary vector, and 
b = z — Bou— LN7'y— Bı Dv. 
Then, by substituting (A.58) into (A.57), we get 
Xol, u, y, v) T -7 (Atb F (I =z At Â)E zi ¢*) T So 
(Âtb + (I — Ât Â) — ¢*) = 7T Dv 
+uTRu Say N'y + PT Soc. 
Now the maximization of 89(¢,u, y, v) with respect to ¢ is 
equivalent to the maximization of xo(§, u, y, v) with respect 


to € € R”. First, let us replace ¢* by At AC* + (I — Ât Â) 
in the above equation 


Xolé, u, y, v) = —7? (Â' (b — Â¢*) + (I a Ât Â)(E = ¢*)) So 


(At(b— Â¢*) + (I — Ât Â) (E — ¢*)) + uT Ru 
— yT N y — y?r Dv +y? CT* SoC*. 
Since A(I — Ât Â) = 0, if So > 0, then 


arg MaXgeRr xo(€, u, Yy, v) =¢, 


and 

kolz, u, Y, V) := maxeern Xol, u, y, v) 
= —7° (x — Bazu — LN™'y — Bı Dv — Â¢*)" At So At 
(x — Bu — LN ` ty — Bı Dv A¢*) yv Dv 
+u Ru- y?y N ty + yC Soc*. 





Note that maxecrn Xo(é, u, y, V) is unbounded if So has a 

negative eigenvalue. Now to determine p;(x), we just need 

to solve the following optimization problem 
Pı (x) = Maxyerd Ko (x, u, Y, v). (A.59) 


By taking the derivative of Ko(z,u,y,v) with respect to v 
and equating it to zero, we get 


D(Qov* — c) = 0, 

where 

Qo = I + BI ÂT So At B,D, 

c = BT ÂT So At (x — Byu— LN ty — AC*), 

v* = arg max,epa kolz, u, y, v). 
We know that aDD],N -1 = 0, for any a € R, therefore, 

Qov* — c = aD], N™'. 
Accordingly, assuming a = 0, one value for v* is 
y* :=05 BT AIT So Al (x = Byun — LN-1y = Ac’), 


given the condition that Qo > 0. By using the matrix inver- 
sion lemma and considering 

Yı := ASp 1 AT + B, DBI, (A.60) 
then under the condition that Sp > 0 and Yı > 0, we have 


v* := BIY! (æ — Buu — LN71y — AC’). 


Moreover, by using the matrix inversion lemma, we have 
051 =I — DB] (AS;1A™ + B,DBT)-1By. 


The above equality indicates that Sp > 0 is enough to 
guarantee Qo > 0. Therefore, if Sg > 0, then 


pi(z) = ko(x,u,y,v*) = —7?(a — Bau — LNT ty — Ac*)t 
Att S, At (I — B, DBTY,-') (x — Bou — LN~1y — Ac*) 
+u Ru — yT N ty + yC Sot. 
Accordingly, by using (A.60), we can conclude 
Alt Sy At (I — B,DBTY,') = Yr*, 
which finally simplifies pı (x) as follows 
pila) = =° (2 — 21) 1Y (£ — 31) + Z1, 
under the condition that Sọ > 0, and Yı > 0, where 
zı := (ÂSF!C] + L)N~1y + (Ba +77’ AS9 1 P)u, 
Zı :=u (R+ ° PTS P)u +2y1N-'CS7'Pu 
— yyT NHI — C287 CINT })y. 
We can also show that the controllability of (Â, B,D) 
and So > O are sufficient conditions to guarantee Yı > 0. 
Based on (A.60), we can infer that Y; is at least positive 
semi-definite when Sp > 0. Now let us take 9 € R” asa 


nonzero eigenvector of Y, corresponding to its possible zero 
eigenvalue. Then based on (A.60), we have 


oA=0, oT B,D=0. 
This result in 
o'[B,D AB,D...A"~1B,D] =0. 


Therefore, the controllability matrix for (A, B,D) does not 
have n independent rows, which contradicts the controllabil- 
ity assumption. Thus, Y; is positive definite. 

Now, based on an induction argument, we assume that at 


an arbitrary time t € No, 
pilte) = —77(ae — Fe) TY, (te — T) + Ze, (A61) 


where Z; is a known state estimation. Then in order to find 
the information state at time t + 1, we need to solve 


pe+i(@) = max max{p;(¢) + TQ + ul Ru + 2¢7 Pu 
weRé CER” 


—yuwtw; y= CC + Daw, «= AC + Biw + Bou}, 





where £ := %141, C:= Zt, Y= Yt, U= Up, W = Uz. We 
follow the same steps as before and firstly substitute the 
disturbance vector (A.55) into the optimization cost function. 
Therefore, assuming ĉ := z+, we have 


Bilku, y, v) = —97(C — B)TY T (C — 2) + Z +CT 
+uT Rut 2¢7Pu—ywtw = —77(¢TS,¢ — 2¢T Re) 
+ Zi +u Ru — vt Dv — ° (YTN ty + &Ty, tê), 








where 
S: := Y; + CINC- 7°Q, 
Ri := CIN y +Y; tê +y’? Pu. 





We assume S, is invertible, and complete the squares for Ç 
as 


Bil, Us y, v) ai 
+u Ru -9° Dv — 7 


=P (E= ETSE C) PETS + Zi 

(WTN y + 8TY, +2), 

where ¢* := S; 'R,. Then by substituting (A.58) into the 

above equation, doing maximization with respect to € and 

rewriting it based on x, we get 

klz, u, y, v) := —7?(@ — Bou — LN~'y — Bı Dv — ÂG)" 
Att, At (x — Bəu — LN ` ty — Bı Dv Â¢*) yuv Dv 
+T" + Z + ul Ru — y (yT N y + êTY +â), 

under the condition that S; > 0 (if S; has a negative eigen- 


value, then the maximization with respect to € does not have 
a solution). Finally, 





peti (x) = maxyepa Kilt, U, y, V), (A.62) 


which results in 
v* = Q BI Â S Al (2 — Bou — LN~+y — AC‘), 


where Q, = J + BY Atts, At B,D, given the condition that 
Q > 0 (which is equivalent to S; > 0 following the similar 
steps as before). Therefore, py4i(v) = Ke(x,u,y,v*). By 
following the similar simplification approach as before, and 
changing £, Ç, y, U, È to %441, Tt, Yt, Ut, Tt, respectively, we 
get 


pesa(@ep1) = 9 (ety — Bea) Vi (wep — Zet) + Zen, 


under the condition that S; > 0 and Yi41 > 0 (which holds 
based on the controllability of (A, B,D) and S; > 0), where 


Yı = AS71AT + By DBT, (A.63) 
and 
Ziyi = Ze +u] (R+ ° PTS P)us + 2y] N-'C2S7 1 Pur 
a o 1(I — CS7 ‘Ol No )ye + 2U! PTS, “Y, 7 2e 
= yP lY (I — S'Y Et +29 YIN CS Y, T. 


Furthermore, the state estimator is 
= Az, + Bout + LN ty; 
C2Z,)). 


Since the information state at time t + 1 has the same 
form as the one assumed at time t, then the assump- 
tion of the induction is true and the information state 
follows (A.61) if S > 0 for every t € No, where Y, fol- 
lows (A.63) and Z;, Z, are determined based on the previous 
two equation for Zo = 0, Zo = 0. 

In order to simplify the future analysis, we can 
rewrite Z,4;, in such a way that no coupling terms 
are present. We carry out this first for the coupling 
term between uv and u,. Consider R; := R + y? PTS 1p 
and $; = Yp! + CJ N-1C, — 7~?Q, then 


= AC* + Bou, + LN~'y, 
+ ÂS (QT: +? Pur + CIN (y: — 


Tt+1 


ti, : = -R PIS CIN ty = -R PTS CIN y 


Therefore, 

Zis1 = = Zi + (up — tie) Re (uy — ùt) + (ey — U) PIS Y; Ti 

— yl NHI — CS7 CIN ye — ùl Rut, + 20] PTSP TY, + 
-PIY (I — Sp Y Et + 2P YIN OS; Y; Te. 


(A.64) 
Now by substituting tu; into the above equation, consider- 
ing û, := uz — ù and noticing that 
NCS PR PIS CIN. baa N 
— I) =? NHI — C287 CI NT!), 
— NICS PRIPIS Yt +y? NICS Y! 
SN O56, Yt, 





-1 (CzS7 CINT! 





we can simplify 2:41 as follows 


Zin = Zi + A Rite — y Vie — PEEVE (I — Sp Yp E 
+ 207 PTS Y T, + 27y N OS Y, T, 
where 7 
V, := NHI — CaS CI NT}). (A.65) 
Then by considering u, := tig — Ùt, Yt := Yt — Ye, We get 





Zig = Z + Uf Rete — VP UTV Ge + TE QT, 
where 
Ü = Sh eee ea = RE PI IE E 
je = VIN OS Y "T, = Coll; ai, 


for Il, := I — y~2Y,Q. Moreover, 

Qe PMS YE! Ye") Ye PR PT Y 
+7°Y, SCI N OÑ) = QÅ. 

Finally, we can represent the state estimator as follows 

B41 = AX, + Bot, + (L + (Â — BoR7'P™)S~'CJ)N-ly 

+ ÅST! (Y7? QT +7? Pi, — CIN71C2%) = AZ, + Bot, 

+ Boi, + (L + ÅŠT!CI)N t + (L + AS“!C7) N71, 

+ ÅST (V72 QT; +77? Pū, +77? Pit, — CIN71C2%) 

= Apt, + Bitu + Dije, 

















A — All; Bı = Bə +72 ÂST!P, and 


-(Â-BR1P1T) S7 CI) NT! = AS OI N-1+LY%,. 








